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Gene Expression - Introduction (GPC) 
class="introduction" 


The 
genetic 
content of 
each 
somatic 
cell in an 
organism 
is the 
same, but 
not all 
genes are 
expressed 
in every 
cell. The 
control of 
which 
genes are 
expressed 
dictates 
whether a 
cell is (a) 
an eye cell 
or(b)a 
liver cell. 
It is the 
differentia 
Il gene 
expression 
patterns 
that arise 
in 
different 
cells that 
give rise 


to(c)a 
complete 
organism. 


Each somatic cell in the body generally contains the same DNA. A few 
exceptions include red blood cells, which contain no DNA in their mature 
state, and some immune system cells that rearrange their DNA while 
producing antibodies. In general, however, the genes that determine 
whether you have green eyes, brown hair, and how fast you metabolize food 
are the same in the cells in your eyes and your liver, even though these 
organs function quite differently. If each cell has the same DNA, how is it 
that cells or organs are different? Why do cells in the eye differ so 
dramatically from cells in the liver? 


Whereas each cell shares the same genome and DNA sequence, each cell 
does not turn on, or express, the same set of genes. Each cell type needs a 
different set of proteins to perform its function. Therefore, only a small 
subset of proteins is expressed in a cell. For the proteins to be expressed, 
the DNA must be transcribed into RNA and the RNA must be translated 
into protein. In a given cell type, not all genes encoded in the DNA are 
transcribed into RNA or translated into protein because specific cells in our 
body have specific functions. Specialized proteins that make up the eye 
(iris, lens, and cornea) are only expressed in the eye, whereas the 
specialized proteins in the heart (pacemaker cells, heart muscle, and valves) 


are only expressed in the heart. At any given time, only a subset of all of the 
genes encoded by our DNA are expressed and translated into proteins. The 
expression of specific genes is a highly regulated process with many levels 
and stages of control. This complexity ensures the proper expression in the 
proper cell at the proper time. 


Gene Expression - Regulation (GPC) 
By the end of this section, you will be able to: 


e Discuss why every cell does not express all of its genes 

e Describe how prokaryotic gene regulation occurs at the transcriptional 
level 

e Discuss how eukaryotic gene regulation occurs at the epigenetic, 
transcriptional, post-transcriptional, translational, and post- 
translational levels 


For a cell to function properly, necessary proteins must be synthesized at 
the proper time. All cells control or regulate the synthesis of proteins from 
information encoded in their DNA. The process of turning on a gene to 
produce RNA and protein is called gene expression. Whether in a simple 
unicellular organism or a complex multi-cellular organism, each cell 
controls when and how its genes are expressed. For this to occur, there must 
be a mechanism to control when a gene is expressed to make RNA and 
protein, how much of the protein is made, and when it is time to stop 
making that protein because it is no longer needed. 


The regulation of gene expression conserves energy and space. It would 
require a significant amount of energy for an organism to express every 
gene at all times, so it is more energy efficient to turn on the genes only 
when they are required. In addition, only expressing a subset of genes in 
each cell saves space because DNA must be unwound from its tightly 
coiled structure to transcribe and translate the DNA. Cells would have to be 
enormous if every protein were expressed in every cell all the time. 


The control of gene expression is extremely complex. Malfunctions in this 
process are detrimental to the cell and can lead to the development of many 
diseases, including cancer. 


Prokaryotic versus Eukaryotic Gene Expression 


To understand how gene expression is regulated, we must first understand 
how a gene codes for a functional protein in a cell. The process occurs in 
both prokaryotic and eukaryotic cells, just in slightly different manners. 


Prokaryotic organisms are single-celled organisms that lack a cell nucleus, 
and their DNA therefore floats freely in the cell cytoplasm. To synthesize a 
protein, the processes of transcription and translation occur almost 
simultaneously. When the resulting protein is no longer needed, 
transcription stops. As a result, the primary method to control what type of 
protein and how much of each protein is expressed in a prokaryotic cell is 
the regulation of DNA transcription. All of the subsequent steps occur 
automatically. When more protein is required, more transcription occurs. 
Therefore, in prokaryotic cells, the control of gene expression is mostly at 
the transcriptional level. 


Eukaryotic cells, in contrast, have intracellular organelles that add to their 
complexity. In eukaryotic cells, the DNA is contained inside the cell’s 
nucleus and there it is transcribed into RNA. The newly synthesized RNA is 
then transported out of the nucleus into the cytoplasm, where ribosomes 
translate the RNA into protein. The processes of transcription and 
translation are physically separated by the nuclear membrane; transcription 
occurs only within the nucleus, and translation occurs only outside the 
nucleus in the cytoplasm. The regulation of gene expression can occur at all 
stages of the process ({link]). Regulation may occur when the DNA is 
uncoiled and loosened from nucleosomes to bind transcription factors 
(epigenetic level), when the RNA is transcribed (transcriptional level), 
when the RNA is processed and exported to the cytoplasm after it is 
transcribed (post-transcriptional level), when the RNA is translated into 
protein (translational level), or after the protein has been made (post- 
translational level). 
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Prokaryotic transcription and translation occur simultaneously 
in the cytoplasm, and regulation occurs at the transcriptional 
level. Eukaryotic gene expression is regulated during 
transcription and RNA processing, which take place in the 
nucleus, and during protein translation, which takes place in 
the cytoplasm. Further regulation may occur through post- 
translational modifications of proteins. 


The differences in the regulation of gene expression between prokaryotes 
and eukaryotes are summarized in [link]. The regulation of gene expression 
is discussed in detail in subsequent modules. 


Differences in the Regulation of Gene Expression of Prokaryotic 
and Eukaryotic Organisms 


Prokaryotic 
organisms Eukaryotic organisms 


Lack nucleus Contain nucleus 


Differences in the Regulation of Gene Expression of Prokaryotic 
and Eukaryotic Organisms 


Prokaryotic 
organisms Eukaryotic organisms 


DNA is found in DNA is confined to the nuclear compartment 
the cytoplasm 

RNA transcription 
and protein 
formation occur 
almost 
simultaneously 


RNA transcription occurs prior to protein 
formation, and it takes place in the nucleus. 
Translation of RNA to protein occurs in the 
cytoplasm. 


Gene expression is 


regulated primarily Gene expression is regulated at many levels 


(epigenetic, transcriptional, nuclear 


h ae 
Seg bos shuttling, post-transcriptional, translational, 
transcriptional ‘ 

and post-translational) 
level 
Note: 


Evolution Connection 

Evolution of Gene Regulation 

Prokaryotic cells can only regulate gene expression by controlling the 
amount of transcription. As eukaryotic cells evolved, the complexity of the 
control of gene expression increased. For example, with the evolution of 
eukaryotic cells came compartmentalization of important cellular 
components and cellular processes. A nuclear region that contains the 
DNA was formed. Transcription and translation were physically separated 
into two different cellular compartments. It therefore became possible to 
control gene expression by regulating transcription in the nucleus, and also 
by controlling the RNA levels and protein translation present outside the 
nucleus. 


Some cellular processes arose from the need of the organism to defend 
itself. Cellular processes such as gene silencing developed to protect the 
cell from viral or parasitic infections. If the cell could quickly shut off gene 
expression for a short period of time, it would be able to survive an 
infection when other organisms could not. Therefore, the organism evolved 
a new process that helped it survive, and it was able to pass this new 
development to offspring. 


Section Summary 


While all somatic cells within an organism contain the same DNA, not all 
cells within that organism express the same proteins. Prokaryotic organisms 
express the entire DNA they encode in every cell, but not necessarily all at 
the same time. Proteins are expressed only when they are needed. 
Eukaryotic organisms express a subset of the DNA that is encoded in any 
given cell. In each cell type, the type and amount of protein is regulated by 
controlling gene expression. To express a protein, the DNA is first 
transcribed into RNA, which is then translated into proteins. In prokaryotic 
cells, these processes occur almost simultaneously. In eukaryotic cells, 
transcription occurs in the nucleus and is separate from the translation that 
occurs in the cytoplasm. Gene expression in prokaryotes is regulated only at 
the transcriptional level, whereas in eukaryotic cells, gene expression is 
regulated at the epigenetic, transcriptional, post-transcriptional, 
translational, and post-translational levels. 


Glossary 


epigenetic 
heritable changes that do not involve changes in the DNA sequence 


gene expression 
processes that control the turning on or turning off of a gene 


post-transcriptional 


control of gene expression after the RNA molecule has been created 
but before it is translated into protein 


post-translational 
control of gene expression after a protein has been created 


Gene Expression - Prokaryotic Gene Regulation 
By the end of this section, you will be able to: 


e Describe the steps involved in prokaryotic gene regulation 
e Explain the roles of activators, inducers, and repressors in gene 
regulation 


The DNA of prokaryotes is organized into a circular chromosome 
supercoiled in the nucleoid region of the cell cytoplasm. Proteins that are 
needed for a specific function, or that are involved in the same biochemical 
pathway, are encoded together in blocks called operons. For example, all of 
the genes needed to use lactose as an energy source are coded next to each 
other in the lactose (or lac) operon. 


In prokaryotic cells, there are three types of regulatory molecules that can 
affect the expression of operons: repressors, activators, and inducers. 
Repressors are proteins that suppress transcription of a gene in response to 
an external stimulus, whereas activators are proteins that increase the 
transcription of a gene in response to an external stimulus. Finally, inducers 
are small molecules that either activate or repress transcription depending 
on the needs of the cell and the availability of substrate. 


The trp Operon: A Repressor Operon 


Bacteria such as E. coli need amino acids to survive. Tryptophan is one 
such amino acid that E. coli can ingest from the environment. E. coli can 
also synthesize tryptophan using enzymes that are encoded by five genes. 
These five genes are next to each other in what is called the tryptophan 
(trp) operon ({link]). If tryptophan is present in the environment, then E. 
coli does not need to synthesize it and the switch controlling the activation 
of the genes in the trp operon is switched off. However, when tryptophan 
availability is low, the switch controlling the operon is turned on, 
transcription is initiated, the genes are expressed, and tryptophan is 
synthesized. 


When tryptophan is present, the trp repressor binds 
the operator, and RNA synthesis is blocked. 


RNA Polymerase Tryptophan 


In the absence of tryptophan, the repressor dissociates 
from the operator, and RNA synthesis proceeds. 


Operator trpD trpC trpA 
| Operator | pe | woo | mpc | ope | tp | 


RNA Polymerase =——> 


The five genes that are needed to synthesize 
tryptophan in E. coli are located next to each other 
in the trp operon. When tryptophan is plentiful, 
two tryptophan molecules bind the repressor 
protein at the operator sequence. This physically 
blocks the RNA polymerase from transcribing the 
tryptophan genes. When tryptophan is absent, the 
repressor protein does not bind to the operator and 
the genes are transcribed. 


A DNA sequence that codes for proteins is referred to as the coding region. 
The five coding regions for the tryptophan biosynthesis enzymes are 
arranged sequentially on the chromosome in the operon. Just before the 
coding region is the transcriptional start site. This is the region of DNA to 
which RNA polymerase binds to initiate transcription. The promoter 
sequence is upstream of the transcriptional start site; each operon has a 
sequence within or near the promoter to which proteins (activators or 
repressors) can bind and regulate transcription. 


A DNA sequence called the operator sequence is encoded between the 
promoter region and the first trp coding gene. This operator contains the 


DNA code to which the repressor protein can bind. When tryptophan is 
present in the cell, two tryptophan molecules bind to the trp repressor, 
which changes shape to bind to the trp operator. Binding of the tryptophan— 
repressor complex at the operator physically prevents the RNA polymerase 
from binding, and transcribing the downstream genes. 


When tryptophan is not present in the cell, the repressor by itself does not 
bind to the operator; therefore, the operon is active and tryptophan is 
synthesized. Because the repressor protein actively binds to the operator to 
keep the genes turned off, the trp operon is negatively regulated and the 
proteins that bind to the operator to silence trp expression are negative 
regulators. 


Note: 
Link to Learning 
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Watch this video to learn more about the trp operon. 


Catabolite Activator Protein (CAP): An Activator Regulator 


Just as the trp operon is negatively regulated by tryptophan molecules, there 
are proteins that bind to the operator sequences that act as a positive 
regulator to tum genes on and activate them. For example, when glucose is 
scarce, E. coli bacteria can turn to other sugar sources for fuel. To do this, 
new genes to process these alternate genes must be transcribed. When 
glucose levels drop, cyclic AMP (cAMP) begins to accumulate in the cell. 
The cAMP molecule is a signaling molecule that is involved in glucose and 


energy metabolism in E. coli. When glucose levels decline in the cell, 
accumulating cAMP binds to the positive regulator catabolite activator 
protein (CAP), a protein that binds to the promoters of operons that control 
the processing of alternative sugars. When cAMP binds to CAP, the 
complex binds to the promoter region of the genes that are needed to use 
the alternate sugar sources ([link]). In these operons, a CAP binding site is 
located upstream of the RNA polymerase binding site in the promoter. This 
increases the binding ability of RNA polymerase to the promoter region and 
the transcription of the genes. 


In the absence of CAMP, CAP does 
not bind the promoter. Transcription 
occurs at a low rate. 


lacZ lacY 


RNA Polymerase 


In the presence of CAMP, CAP binds 
the promoter and increases RNA 
polymerase activity. 


RNA Polymerase 


When glucose levels fall, E. coli may use other 
sugars for fuel but must transcribe new genes to do 
so. As glucose supplies become limited, cAMP 
levels increase. This cAMP binds to the CAP 
protein, a positive regulator that binds to an 
operator region upstream of the genes required to 
use other sugar sources. 


The lac Operon: An Inducer Operon 


The third type of gene regulation in prokaryotic cells occurs through 
inducible operons, which have proteins that bind to activate or repress 
transcription depending on the local environment and the needs of the cell. 
The lac operon is a typical inducible operon. As mentioned previously, E. 
coli is able to use other sugars as energy sources when glucose 
concentrations are low. To do so, the cAMP—CAP protein complex serves 
as a positive regulator to induce transcription. One such sugar source is 
lactose. The lac operon encodes the genes necessary to acquire and process 
the lactose from the local environment. CAP binds to the operator sequence 
upstream of the promoter that initiates transcription of the lac operon. 
However, for the lac operon to be activated, two conditions must be met. 
First, the level of glucose must be very low or non-existent. Second, lactose 
must be present. Only when glucose is absent and lactose is present will the 
lac operon be transcribed ({link]). This makes sense for the cell, because it 
would be energetically wasteful to create the proteins to process lactose if 
glucose was plentiful or lactose was not available. 


Note: 
Art Connection 


In the absence of lactose, the lac repressor 
binds the operator, and transcription is 


blocked. 
Operator | lacZ 


RNA Polymerase>> Repressor 


In the presence of lactose, the lac repressor 
is released from the operator, and 
transcription proceeds at a slow rate. 


RNA Polymerase —= p> 
Repressor 


("Lactose 


cAMP-CAP complex stimulates RNA 
Polymerase activity and increases RNA 
synthesis. 


RNA Polymerase ==> 


However, even in the presence of 
CAMP-CAP complex, RNA synthesis is 
blocked when repressor is bound to 
the operator. 


Operator | lacZ 


RNA Polymerase>é> Repressor 


Transcription of the lac operon 
is carefully regulated so that its 
expression only occurs when 
glucose is limited and lactose is 
present to serve as an 
alternative fuel source. 


In E. coli, the trp operon is on by default, while the lac operon is off. Why 
do you think this is the case? 


If glucose is absent, then CAP can bind to the operator sequence to activate 
transcription. If lactose is absent, then the repressor binds to the operator to 
prevent transcription. If either of these requirements is met, then 
transcription remains off. Only when both conditions are satisfied is the lac 
operon transcribed ([link]). 


Signals that Induce or Repress Transcription of the lac Operon 


CAP Repressor 
Glucose binds Lactose binds Transcription 
4 Z 2 + No 
+ - e - Some 
- + - - No 
- - ite - Yes 


Note: 
Link to Learning 


— petite COLLEGE 


Watch an animated tutorial about the workings of lac operon here. 


Section Summary 


The regulation of gene expression in prokaryotic cells occurs at the 
transcriptional level. There are three ways to control the transcription of an 
operon: repressive control, activator control, and inducible control. 
Repressive control, typified by the trp operon, uses proteins bound to the 
operator sequence to physically prevent the binding of RNA polymerase 
and the activation of transcription. Therefore, if tryptophan is not needed, 
the repressor is bound to the operator and transcription remains off. 
Activator control, typified by the action of CAP, increases the binding 
ability of RNA polymerase to the promoter when CAP is bound. In this 
case, low levels of glucose result in the binding of cAMP to CAP. CAP then 
binds the promoter, which allows RNA polymerase to bind to the promoter 
better. In the last example—the lac operon—two conditions must be met to 
initiate transcription. Glucose must not be present, and lactose must be 
available for the lac operon to be transcribed. If glucose is absent, CAP 
binds to the operator. If lactose is present, the repressor protein does not 
bind to its operator. Only when both conditions are met will RNA 
polymerase bind to the promoter to induce transcription. 


Art Connections 


Exercise: 


Problem: 


[link] In E. coli, the trp operon is on by default, while the lac operon is 
off. Why do you think that this is the case? 


Solution: 


[link] Tryptophan is an amino acid essential for making proteins, so 
the cell always needs to have some on hand. However, if plenty of 
tryptophan is present, it is wasteful to make more, and the expression 
of the trp receptor is repressed. Lactose, a sugar found in milk, is not 
always available. It makes no sense to make the enzymes necessary to 
digest an energy source that is not available, so the lac operon is only 
turned on when lactose is present. 


Glossary 


activator 
protein that binds to prokaryotic operators to increase transcription 


catabolite activator protein (CAP) 
protein that complexes with cAMP to bind to the promoter sequences 
of operons that control sugar processing when glucose is not available 


inducible operon 
operon that can be activated or repressed depending on cellular needs 
and the surrounding environment 


lac operon 
operon in prokaryotic cells that encodes genes required for processing 
and intake of lactose 


negative regulator 
protein that prevents transcription 


operator 
region of DNA outside of the promoter region that binds activators or 
repressors that control gene expression in prokaryotic cells 


operon 
collection of genes involved in a pathway that are transcribed together 
as a Single mRNA in prokaryotic cells 


positive regulator 
protein that increases transcription 


repressor 
protein that binds to the operator of prokaryotic genes to prevent 
transcription 


transcriptional start site 


site at which transcription begins 


trp operon 
series of genes necessary to synthesize tryptophan in prokaryotic cells 


tryptophan 
amino acid that can be synthesized by prokaryotic cells when 
necessary 


Gene Expression - Eukaryotic Transcriptional Regulation (GPC) 
By the end of this section, you will be able to: 


e Discuss the role of transcription factors in gene regulation 
e Explain how enhancers and repressors regulate gene expression 


Like prokaryotic cells, the transcription of genes in eukaryotes requires the 
actions of an RNA polymerase to bind to a sequence upstream of a gene to 
initiate transcription. However, unlike prokaryotic cells, the eukaryotic 
RNA polymerase requires other proteins, or transcription factors, to 
facilitate transcription initiation. Transcription factors are proteins that bind 
to the promoter sequence and other regulatory sequences to control the 
transcription of the target gene. RNA polymerase by itself cannot initiate 
transcription in eukaryotic cells. Transcription factors must bind to the 
promoter region first and recruit RNA polymerase to the site for 
transcription to be established. 


Note: 
Link to Learning 
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View the process of transcription—the making of RNA from a DNA 
template—at this site. 


The Promoter and the Transcription Machinery 


Genes are organized to make the control of gene expression easier. The 
promoter region is immediately upstream of the coding sequence. This 
region can be short (only a few nucleotides in length) or quite long 


(hundreds of nucleotides long). The longer the promoter, the more available 
space for proteins to bind. This also adds more control to the transcription 
process. The length of the promoter is gene-specific and can differ 
dramatically between genes. Consequently, the level of control of gene 
expression can also differ quite dramatically between genes. The purpose of 
the promoter is to bind transcription factors that control the initiation of 
transcription. 


Within the promoter region, just upstream of the transcriptional start site, 
resides the TATA box. This box is simply a repeat of thymine and adenine 
dinucleotides (literally, TATA repeats). RNA polymerase binds to the 
transcription initiation complex, allowing transcription to occur. To initiate 
transcription, a transcription factor (TFIID) is the first to bind to the TATA 
box. Binding of TFIID recruits other transcription factors, including TFIIB, 
TFIIE, TFIIF, and TFIIH to the TATA box. Once this complex is assembled, 
RNA polymerase can bind to its upstream sequence. When bound along 
with the transcription factors, RNA polymerase is phosphorylated. This 
releases part of the protein from the DNA to activate the transcription 
initiation complex and places RNA polymerase in the correct orientation to 
begin transcription; DNA-bending protein brings the enhancer, which can 
be quite a distance from the gene, in contact with transcription factors and 
mediator proteins ((link]). 


DNA bending protein Enhancer 


Distal control 
elements 


Transcription 
factors and mediator 
proteins 


Activators 


RNA polymerase 


An enhancer is a DNA sequence 
that promotes transcription. Each 
enhancer is made up of short DNA 
sequences called distal control 
elements. Activators bound to the 
distal control elements interact 
with mediator proteins and 
transcription factors. Two different 
genes may have the same 
promoter but different distal 
control elements, enabling 
differential gene expression. 


In addition to the general transcription factors, other transcription factors 
can bind to the promoter to regulate gene transcription. These transcription 
factors bind to the promoters of a specific set of genes. They are not general 
transcription factors that bind to every promoter complex, but are recruited 
to a specific sequence on the promoter of a specific gene. There are 


hundreds of transcription factors in a cell that each bind specifically to a 
particular DNA sequence motif. When transcription factors bind to the 
promoter just upstream of the encoded gene, it is referred to as a cis-acting 
element, because it is on the same chromosome just next to the gene. The 
region that a particular transcription factor binds to is called the 
transcription factor binding site. Transcription factors respond to 
environmental stimuli that cause the proteins to find their binding sites and 
initiate transcription of the gene that is needed. 


Enhancers and Transcription 


In some eukaryotic genes, there are regions that help increase or enhance 
transcription. These regions, called enhancers, are not necessarily close to 
the genes they enhance. They can be located upstream of a gene, within the 
coding region of the gene, downstream of a gene, or may be thousands of 
nucleotides away. 


Enhancer regions are binding sequences, or sites, for transcription factors. 
When a DNA-bending protein binds, the shape of the DNA changes 
({link]). This shape change allows for the interaction of the activators bound 
to the enhancers with the transcription factors bound to the promoter region 
and the RNA polymerase. Whereas DNA is generally depicted as a straight 
line in two dimensions, it is actually a three-dimensional object. Therefore, 
a nucleotide sequence thousands of nucleotides away can fold over and 
interact with a specific promoter. 


Turning Genes Off: Transcriptional Repressors 


Like prokaryotic cells, eukaryotic cells also have mechanisms to prevent 
transcription. Transcriptional repressors can bind to promoter or enhancer 
regions and block transcription. Like the transcriptional activators, 
repressors respond to external stimuli to prevent the binding of activating 
transcription factors. 


Section Summary 


To start transcription, general transcription factors, such as TFIID, TFIIH, 
and others, must first bind to the TATA box and recruit RNA polymerase to 
that location. The binding of additional regulatory transcription factors to 
cis-acting elements will either increase or prevent transcription. In addition 
to promoter sequences, enhancer regions help augment transcription. 
Enhancers can be upstream, downstream, within a gene itself, or on other 
chromosomes. Transcription factors bind to enhancer regions to increase or 
prevent transcription. 


Glossary 


cis-acting element 
transcription factor binding sites within the promoter that regulate the 
transcription of a gene adjacent to it 


enhancer 
segment of DNA that is upstream, downstream, perhaps thousands of 
nucleotides away, or on another chromosome that influence the 
transcription of a specific gene 


trans-acting element 
transcription factor binding site found outside the promoter or on 
another chromosome that influences the transcription of a particular 
gene 


transcription factor binding site 
sequence of DNA to which a transcription factor binds 


Eukaryotic Post-transcriptional Gene Regulation (GPC) 
By the end of this section, you will be able to: 


e Understand RNA splicing and explain its role in regulating gene 
expression 
¢ Describe the importance of RNA stability in gene regulation 


RNA is transcribed, but must be processed into a mature form before 
translation can begin. This processing after an RNA molecule has been 
transcribed, but before it is translated into a protein, is called post- 
transcriptional modification. As with the epigenetic and transcriptional 
stages of processing, this post-transcriptional step can also be regulated to 
control gene expression in the cell. If the RNA is not processed, shuttled, or 
translated, then no protein will be synthesized. 


RNA splicing, the first stage of post-transcriptional control 


In eukaryotic cells, the RNA transcript often contains regions, called 
introns, that are removed prior to translation. The regions of RNA that code 
for protein are called exons ([link]). After an RNA molecule has been 
transcribed, but prior to its departure from the nucleus to be translated, the 
RNA is processed and the introns are removed by splicing. 


YN 


Pre-mRNA can be alternatively spliced to 
create different proteins. 


Note: 

Evolution Connection 

Alternative RNA Splicing 

In the 1970s, genes were first observed that exhibited alternative RNA 
splicing. Alternative RNA splicing is a mechanism that allows different 
protein products to be produced from one gene when different 
combinations of introns, and sometimes exons, are removed from the 
transcript ([link]). This alternative splicing can be haphazard, but more 
often it is controlled and acts as a mechanism of gene regulation, with the 
frequency of different splicing alternatives controlled by the cell as a way 
to control the production of different protein products in different cells or 
at different stages of development. Alternative splicing is now understood 
to be acommon mechanism of gene regulation in eukaryotes; according to 
one estimate, 70 percent of genes in humans are expressed as multiple 
proteins through alternative splicing. 


= 


Exon skipping 


Mutually exclusive exons 
Alternative 5' donor sites 


Alternative 3’ acceptor sites 


Intron retention 


There are five basic modes of alternative 
splicing. 


How could alternative splicing evolve? Introns have a beginning and 
ending recognition sequence; it is easy to imagine the failure of the 
splicing mechanism to identify the end of an intron and instead find the 
end of the next intron, thus removing two introns and the intervening exon. 
In fact, there are mechanisms in place to prevent such intron skipping, but 
mutations are likely to lead to their failure. Such “mistakes” would more 
than likely produce a nonfunctional protein. Indeed, the cause of many 
genetic diseases is alternative splicing rather than mutations in a sequence. 
However, alternative splicing would create a protein variant without the 
loss of the original protein, opening up possibilities for adaptation of the 
new variant to new functions. Gene duplication has played an important 
role in the evolution of new functions in a similar way by providing genes 
that may evolve without eliminating the original, functional protein. 


Note: 
Link to Learning 
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Visualize how mRNA splicing happens by watching the process in action 
in this video. 


Control of RNA Stability 


Before the mRNA leaves the nucleus, it is given two protective "caps" that 
prevent the end of the strand from degrading during its journey. The 5" cap, 
which is placed on the 5' end of the mRNA, is usually composed of a 
methylated guanosine triphosphate molecule (GTP). The poly-A tail, which 
is attached to the 3' end, is usually composed of a series of adenine 


nucleotides. Once the RNA is transported to the cytoplasm, the length of 
time that the RNA resides there can be controlled. Each RNA molecule has 
a defined lifespan and decays at a specific rate. This rate of decay can 
influence how much protein is in the cell. If the decay rate is increased, the 
RNA will not exist in the cytoplasm as long, shortening the time for 
translation to occur. Conversely, if the rate of decay is decreased, the RNA 
molecule will reside in the cytoplasm longer and more protein can be 
translated. This rate of decay is referred to as the RNA stability. If the RNA 
is stable, it will be detected for longer periods of time in the cytoplasm. 


Binding of proteins to the RNA can influence its stability. Proteins, called 
RNA-binding proteins, or RBPs, can bind to the regions of the RNA just 
upstream or downstream of the protein-coding region. These regions in the 
RNA that are not translated into protein are called the untranslated 
regions, or UTRs. They are not introns (those have been removed in the 
nucleus). Rather, these are regions that regulate MRNA localization, 
stability, and protein translation. The region just before the protein-coding 
region is called the 5' UTR, whereas the region after the coding region is 
called the 3' UTR ([link]). The binding of RBPs to these regions can 
increase or decrease the stability of an RNA molecule, depending on the 
specific RBP that binds. 


RNA-binding proteins 


ogo mm ies 


5' cap poly-A tail 


The protein-coding region of mRNA is flanked by 
5' and 3' untranslated regions (UTRs). The 
presence of RNA-binding proteins at the 5' or 3' 
UTR influences the stability of the RNA molecule. 


RNA Stability and microRNAs 


In addition to RBPs that bind to and control (increase or decrease) RNA 
stability, other elements called microRNAs can bind to the RNA molecule. 
These microRNAs, or miRNAs, are short RNA molecules that are only 21— 
24 nucleotides in length. The miRNAs are made in the nucleus as longer 
pre-miRNAs. These pre-miRNAs are chopped into mature miRNAs by a 
protein called dicer. Like transcription factors and RBPs, mature miRNAs 
recognize a specific sequence and bind to the RNA; however, miRNAs also 
associate with a ribonucleoprotein complex called the RNA-induced 
silencing complex (RISC). RISC binds along with the miRNA to degrade 
the target MRNA. Together, miRNAs and the RISC complex rapidly destroy 
the RNA molecule. 


Section Summary 


Post-transcriptional control can occur at any stage after transcription, 
including RNA splicing, nuclear shuttling, and RNA stability. Once RNA is 
transcribed, it must be processed to create a mature RNA that is ready to be 
translated. This involves the removal of introns that do not code for protein. 
Spliceosomes bind to the signals that mark the exon/intron border to 
remove the introns and ligate the exons together. Once this occurs, the RNA 
is mature and can be translated. RNA is created and spliced in the nucleus, 
but needs to be transported to the cytoplasm to be translated. RNA is 
transported to the cytoplasm through the nuclear pore complex. Once the 
RNA is in the cytoplasm, the length of time it resides there before being 
degraded, called RNA stability, can also be altered to control the overall 
amount of protein that is synthesized. The RNA stability can be increased, 
leading to longer residency time in the cytoplasm, or decreased, leading to 
shortened time and less protein synthesis. RNA stability is controlled by 
RNA-binding proteins (RPBs) and microRNAs (miRNAs). These RPBs and 
miRNAs bind to the 5' UTR or the 3' UTR of the RNA to increase or 
decrease RNA stability. Depending on the RBP, the stability can be 
increased or decreased significantly; however, miRNAs always decrease 
stability and promote decay. 


Glossary 


3'UTR 
3' untranslated region; region just downstream of the protein-coding 
region in an RNA molecule that is not translated 


3' cap 
a methylated guanosine triphosphate (GTP) molecule that is attached 
to the 5' end of a messenger RNA to protect the end from degradation 


5' UTR 
5' untranslated region; region just upstream of the protein-coding 
region in an RNA molecule that is not translated 


dicer 
enzyme that chops the pre-miRNA into the mature form of the miRNA 


microRNA (miRNA) 
small RNA molecules (approximately 21 nucleotides in length) that 
bind to RNA molecules to degrade them 


poly-A tail 
a series of adenine nucleotides that are attached to the 3' end of an 
mRNA to protect the end from degradation 


RNA-binding protein (RBP) 
protein that binds to the 3' or 5' UTR to increase or decrease the RNA 
stability 


RNA stability 
how long an RNA molecule will remain intact in the cytoplasm 


untranslated region 
segment of the RNA molecule that are not translated into protein. 
These regions lie before (upstream or 5') and after (downstream or 3') 
the protein-coding region 


RISC 


protein complex that binds along with the miRNA to the RNA to 
degrade it 


Eukaryotic Translational and Post-translational Gene Regulation (GPC) 
By the end of this section, you will be able to: 


¢ Understand the process of translation and discuss its key factors 

¢ Describe how the initiation complex controls translation 

e Explain the different ways in which the post-translational control of 
gene expression takes place 


After the RNA has been transported to the cytoplasm, it is translated into 
protein. Control of this process is largely dependent on the RNA molecule. 
As previously discussed, the stability of the RNA will have a large impact 
on its translation into a protein. As the stability changes, the amount of time 
that it is available for translation also changes. 


The Initiation Complex and Translation Rate 


Like transcription, translation is controlled by proteins that bind and initiate 
the process. In translation, the complex that assembles to start the process is 
referred to as the initiation complex. The first protein to bind to the RNA 
to initiate translation is the eukaryotic initiation factor-2 (eIF-2). The elF- 
2 protein is active when it binds to the high-energy molecule guanosine 
triphosphate (GTP). GTP provides the energy to start the reaction by 
giving up a phosphate and becoming guanosine diphosphate (GDP). The 
elF-2 protein bound to GTP binds to the small 40S ribosomal subunit. 
When bound, the methionine initiator tRNA associates with the eIF-2/40S 
ribosome complex, bringing along with it the mRNA to be translated. At 
this point, when the initiator complex is assembled, the GTP is converted 
into GDP and energy is released. The phosphate and the elF-2 protein are 
released from the complex and the large 60S ribosomal subunit binds to 
translate the RNA. The binding of eIF-2 to the RNA is controlled by 
phosphorylation. If e[F-2 is phosphorylated, it undergoes a conformational 
change and cannot bind to GTP. Therefore, the initiation complex cannot 
form properly and translation is impeded ({link]). When eIF-2 remains 
unphosphorylated, it binds the RNA and actively translates the protein. 


Note: 
Art Connection 


When elF2 is 
phosphorylated, 
translation is 
‘ blocked. 
Ribosome No 
small (40S) subunit Translation 


When elF2 is not 
phosphorylated, 


era translation 
2 occurs. ; 
Ribosome Translation 


small (40S) subunit occurs 


Gene expression can be controlled 
by factors that bind the translation 
initiation complex. 


An increase in phosphorylation levels of e[F-2 has been observed in 
patients with neurodegenerative diseases such as Alzheimer’s, Parkinson’s, 
and Huntington’s. What impact do you think this might have on protein 
synthesis? 


Chemical Modifications, Protein Activity, and Longevity 


Proteins can be chemically modified with the addition of groups including 
methyl, phosphate, acetyl, and ubiquitin groups. The addition or removal of 
these groups from proteins regulates their activity or the length of time they 
exist in the cell. Sometimes these modifications can regulate where a 
protein is found in the cell—for example, in the nucleus, the cytoplasm, or 
attached to the plasma membrane. 


Chemical modifications occur in response to external stimuli such as stress, 
the lack of nutrients, heat, or ultraviolet light exposure. These changes can 
alter epigenetic accessibility, transcription, MRNA stability, or translation— 
all resulting in changes in expression of various genes. This is an efficient 
way for the cell to rapidly change the levels of specific proteins in response 


to the environment. Because proteins are involved in every stage of gene 
regulation, the phosphorylation of a protein (depending on the protein that 
is modified) can alter accessibility to the chromosome, can alter translation 
(by altering transcription factor binding or function), can change nuclear 
shuttling (by influencing modifications to the nuclear pore complex), can 
alter RNA stability (by binding or not binding to the RNA to regulate its 
stability), can modify translation (increase or decrease), or can change post- 
translational modifications (add or remove phosphates or other chemical 
modifications). 


The addition of an ubiquitin group to a protein marks that protein for 
degradation. Ubiquitin acts like a flag indicating that the protein lifespan is 
complete. These proteins are moved to the proteasome, an organelle that 
functions to remove proteins, to be degraded ([link]). One way to control 
gene expression, therefore, is to alter the longevity of the protein. 


Ubiquitin 
a Proteasome 


Proteins with ubiquitin tags are 
marked for degradation within the 
proteasome. 


Section Summary 


Changing the status of the RNA or the protein itself can affect the amount 
of protein, the function of the protein, or how long it is found in the cell. To 
translate the protein, a protein initiator complex must assemble on the RNA. 


Modifications (such as phosphorylation) of proteins in this complex can 
prevent proper translation from occurring. Once a protein has been 
synthesized, it can be modified (phosphorylated, acetylated, methylated, or 
ubiquitinated). These post-translational modifications can greatly impact 
the stability, degradation, or function of the protein. 


Art Connections 


Exercise: 


Problem: 


[link] An increase in phosphorylation levels of eI[F-2 has been 
observed in patients with neurodegenerative diseases such as 
Alzheimer’s, Parkinson’s, and Huntington’s. What impact do you think 
this might have on protein synthesis? 


Solution: 


[link] Protein synthesis would be inhibited. 


Glossary 


eukaryotic initiation factor-2 (eIF-2) 
protein that binds first to an MRNA to initiate translation 


guanine diphosphate (GDP) 
molecule that is left after the energy is used to start translation 


guanine triphosphate (GTP) 
energy-providing molecule that binds to eIF-2 and is needed for 
translation 


initiation complex 
protein complex containing eIF2-2 that starts translation 


large 60S ribosomal subunit 


second, larger ribosomal subunit that binds to the RNA to translate it 
into protein 


proteasome 
organelle that degrades proteins 


small 40S ribosomal subunit 
ribosomal subunit that binds to the RNA to translate it into protein 


Gene Expression - Epigenetic Gene Regulation (GPC) 
By the end of this section, you will be able to: 


e Explain the process of epigenetic regulation 
¢ Describe how access to DNA is controlled by histone modification 


Eukaryotic gene expression is more complex than prokaryotic gene 
expression because the processes of transcription and translation are 
physically separated. Unlike prokaryotic cells, eukaryotic cells can regulate 
gene expression at many different levels. Eukaryotic gene expression begins 
with control of access to the DNA. This form of regulation, called 
epigenetic regulation, occurs even before transcription is initiated. 


Epigenetic Control: Regulating Access to Genes within the 
Chromosome 


The human genome encodes over 20,000 genes; each of the 23 pairs of 
human chromosomes encodes thousands of genes. The DNA in the nucleus 
is precisely wound, folded, and compacted into chromosomes so that it will 
fit into the nucleus. It is also organized so that specific segments can be 
accessed as needed by a specific cell type. 


The first level of organization, or packing, is the winding of DNA strands 
around histone proteins. Histones package and order DNA into structural 
units called nucleosome complexes, which can control the access of 
proteins to the DNA regions ({link]a). Under the electron microscope, this 
winding of DNA around histone proteins to form nucleosomes looks like 
small beads on a string ([link]b). These beads (histone proteins) can move 
along the string (DNA) and change the structure of the molecule. 


Histone 


Nucleosome 


(a) (b) 


DNA is folded around histone proteins to create (a) 
nucleosome complexes. These nucleosomes control the access 
of proteins to the underlying DNA. When viewed through an 
electron microscope (b), the nucleosomes look like beads on a 
string. (credit “micrograph”: modification of work by Chris 
Woodcock) 


If DNA encoding a specific gene is to be transcribed into RNA, the 
nucleosomes surrounding that region of DNA can slide down the DNA to 
open that specific chromosomal region and allow for the transcriptional 
machinery (RNA polymerase) to initiate transcription ([link]). Nucleosomes 
can move to open the chromosome structure to expose a segment of DNA, 
but do so in a very controlled manner. 


Note: 
Art Connection 


Histone tail 


Methyl group 


DNA inaccessible, gene inactive 


Histone tail 


Acetyl group 


DNA accessible, gene active 


Methylation of DNA and 
histones causes nucleosomes 
to pack tightly together. 
Transcription factors cannot 
bind the DNA, and genes are 
not expressed. 


Histone acetylation results 

in loose packing of nucleo- 
somes. Transcription factors 
can bind the DNA and genes 
are expressed. 


Nucleosomes can slide along DNA. When 
nucleosomes are spaced closely together (top), 
transcription factors cannot bind and gene 
expression is turned off. When the nucleosomes 
are spaced far apart (bottom), the DNA is exposed. 
Transcription factors can bind, allowing gene 
expression to occur. Modifications to the histones 
and DNA affect nucleosome spacing. 


In females, one of the two X chromosomes is inactivated during embryonic 
development because of epigenetic changes to the chromatin. What impact 
do you think these changes would have on nucleosome packing? 


How the histone proteins move is dependent on signals found on both the 
histone proteins and on the DNA. These signals are tags added to histone 
proteins and DNA that tell the histones if a chromosomal region should be 
open or closed ([link] depicts modifications to histone proteins and DNA). 
These tags are not permanent, but may be added or removed as needed. 


They are chemical modifications (phosphate, methyl, or acetyl groups) that 
are attached to specific amino acids in the protein or to the nucleotides of 
the DNA. The tags do not alter the DNA base sequence, but they do alter 
how tightly wound the DNA is around the histone proteins. DNA is a 
negatively charged molecule; therefore, changes in the charge of the histone 
will change how tightly wound the DNA molecule will be. When 
unmodified, the histone proteins have a large positive charge; by adding 
chemical modifications like acetyl groups, the charge becomes less positive. 


The DNA molecule itself can also be modified. This occurs within very 
specific regions called CpG islands. These are stretches with a high 
frequency of cytosine and guanine dinucleotide DNA pairs (CG) found in 
the promoter regions of genes. When this configuration exists, the cytosine 
member of the pair can be methylated (a methyl group is added). This 
modification changes how the DNA interacts with proteins, including the 
histone proteins that control access to the region. Highly methylated 
(hypermethylated) DNA regions with deacetylated histones are tightly 
coiled and transcriptionally inactive. 


EPIGENETIC CHANGES TO THE EPIGENETIC CHANGES 
CHROMATIN MAY RESULT FROM MAY RESULT IN 

+ Development (in utero, childhood) + Cancer 

* Environmental chemicals + Autoimmune disease 
+ Drugs/Pharmaceuticals + Mental disorders 

+ Aging + Diabetes 


+ Diet 
CHROMATIN @ *cey group 


Pies Oe Q METHYL GROUP } 
/ 


‘DNA methylation and chemical 
Histones are proteins around modification 

which DNA winds for compaction ‘spa 
and gene regulation. 


Histone proteins and DNA nucleotides can be modified 


chemically. Modifications affect nucleosome spacing and 
gene expression. (credit: modification of work by NIH) 


This type of gene regulation is called epigenetic regulation. Epigenetic 
means “around genetics.” The changes that occur to the histone proteins 
and DNA do not alter the nucleotide sequence and are not permanent. 
Instead, these changes are temporary (although they often persist through 
multiple rounds of cell division) and alter the chromosomal structure (open 
or closed) as needed. A gene can be turned on or off depending upon the 
location and modifications to the histone proteins and DNA. If a gene is to 
be transcribed, the histone proteins and DNA are modified surrounding the 
chromosomal region encoding that gene. This opens the chromosomal 
region to allow access for RNA polymerase and other proteins, called 
transcription factors, to bind to the promoter region, located just upstream 
of the gene, and initiate transcription. If a gene is to remain turned off, or 
silenced, the histone proteins and DNA have different modifications that 
signal a closed chromosomal configuration. In this closed configuration, the 
RNA polymerase and transcription factors do not have access to the DNA 
and transcription cannot occur ([link]). 


Note: 
Link to Learning 
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View this video that describes how epigenetic regulation controls gene 
expression. 


Section Summary 


In eukaryotic cells, the first stage of gene expression control occurs at the 
epigenetic level. Epigenetic mechanisms control access to the chromosomal 
region to allow genes to be turned on or off. These mechanisms control how 
DNA is packed into the nucleus by regulating how tightly the DNA is 
wound around histone proteins. The addition or removal of chemical 
modifications (or flags) to histone proteins or DNA signals to the cell to 
open or close a chromosomal region. Therefore, eukaryotic cells can control 
whether a gene is expressed by controlling accessibility to transcription 
factors and the binding of RNA polymerase to initiate transcription. 


Art Connections 


Exercise: 


Problem: 


[link] In females, one of the two X chromosomes is inactivated during 
embryonic development because of epigenetic changes to the 
chromatin. What impact do you think these changes would have on 
nucleosome packing? 


Solution: 


[link] The nucleosomes would pack more tightly together. 


Glossary 


transcription factor 
protein that binds to the DNA at the promoter or enhancer region and 
that influences transcription of a gene 


Cancer and Gene Regulation (GPC) 
By the end of this section, you will be able to: 


¢ Describe how changes to gene expression can cause cancer 

e Explain how changes to gene expression at different levels can disrupt 
the cell cycle 

e Discuss how understanding regulation of gene expression can lead to 
better drug design 


Cancer is not a single disease but includes many different diseases. In 
cancer cells, mutations modify cell-cycle control and cells don’t stop 
growing as they normally would. Mutations can also alter the growth rate or 
the progression of the cell through the cell cycle. One example of a gene 
modification that alters the growth rate is increased phosphorylation of 
cyclin B, a protein that controls the progression of a cell through the cell 
cycle and serves as a cell-cycle checkpoint protein. 


For cells to move through each phase of the cell cycle, the cell must pass 
through checkpoints. This ensures that the cell has properly completed the 
step and has not encountered any mutation that will alter its function. Many 
proteins, including cyclin B, control these checkpoints. The 
phosphorylation of cyclin B, a post-translational event, alters its function. 
As a result, cells can progress through the cell cycle unimpeded, even if 
mutations exist in the cell and its growth should be terminated. This post- 
translational change of cyclin B prevents it from controlling the cell cycle 
and contributes to the development of cancer. 


Cancer: Disease of Altered Gene Expression 


Cancer can be described as a disease of altered gene expression. There are 
many proteins that are turned on or off (gene activation or gene silencing) 
that dramatically alter the overall activity of the cell. A gene that is not 
normally expressed in that cell can be switched on and expressed at high 
levels. This can be the result of gene mutation or changes in gene regulation 
(epigenetic, transcription, post-transcription, translation, or post- 
translation). 


Changes in epigenetic regulation, transcription, RNA stability, protein 
translation, and post-translational control can be detected in cancer. While 
these changes don’t occur simultaneously in one cancer, changes at each of 
these levels can be detected when observing cancer at different sites in 
different individuals. Therefore, changes in histone acetylation (epigenetic 
modification that leads to gene silencing), activation of transcription factors 
by phosphorylation, increased RNA stability, increased translational 
control, and protein modification can all be detected at some point in 
various cancer cells. Scientists are working to understand the common 
changes that give rise to certain types of cancer or how a modification 
might be exploited to destroy a tumor cell. 


Tumor Suppressor Genes, Oncogenes, and Cancer 


In normal cells, some genes function to prevent excess, inappropriate cell 
growth. These are tumor suppressor genes, which are active in normal cells 
to prevent uncontrolled cell growth. There are many tumor suppressor 
genes in cells. The most studied tumor suppressor gene is p53, which is 
mutated in over 50 percent of all cancer types. The p53 protein itself 
functions as a transcription factor. It can bind to sites in the promoters of 
genes to initiate transcription. Therefore, the mutation of p53 in cancer will 
dramatically alter the transcriptional activity of its target genes. 


Note: 
Link to Learning 
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Watch this animation to learn more about the use of p53 in fighting cancer. 


Proto-oncogenes are positive cell-cycle regulators. When mutated, proto- 
oncogenes can become oncogenes and cause cancer. Overexpression of the 
oncogene can lead to uncontrolled cell growth. This is because oncogenes 
can alter transcriptional activity, stability, or protein translation of another 
gene that directly or indirectly controls cell growth. An example of an 
oncogene involved in cancer is a protein called myc. Myc is a transcription 
factor that is aberrantly activated in Burkett’s Lymphoma, a cancer of the 
lymph system. Overexpression of myc transforms normal B cells into 
cancerous cells that continue to grow uncontrollably. High B-cell numbers 
can result in tumors that can interfere with normal bodily function. Patients 
with Burkett’s lymphoma can develop tumors on their jaw or in their mouth 
that interfere with the ability to eat. 


Cancer and Epigenetic Alterations 


Silencing genes through epigenetic mechanisms is also very common in 
cancer cells. There are characteristic modifications to histone proteins and 
DNA that are associated with silenced genes. In cancer cells, the DNA in 
the promoter region of silenced genes is methylated on cytosine DNA 
residues in CpG islands. Histone proteins that surround that region lack the 
acetylation modification that is present when the genes are expressed in 
normal cells. This combination of DNA methylation and histone 
deacetylation (epigenetic modifications that lead to gene silencing) is 
commonly found in cancer. When these modifications occur, the gene 
present in that chromosomal region is silenced. Increasingly, scientists 
understand how epigenetic changes are altered in cancer. Because these 
changes are temporary and can be reversed—for example, by preventing the 
action of the histone deacetylase protein that removes acetyl groups, or by 
DNA methy] transferase enzymes that add methyl groups to cytosines in 
DNA—t is possible to design new drugs and new therapies to take 
advantage of the reversible nature of these processes. Indeed, many 
researchers are testing how a silenced gene can be switched back on in a 
cancer cell to help re-establish normal growth patterns. 


Genes involved in the development of many other illnesses, ranging from 
allergies to inflammation to autism, are thought to be regulated by 


epigenetic mechanisms. As our knowledge of how genes are controlled 
deepens, new ways to treat diseases like cancer will emerge. 


Cancer and Transcriptional Control 


Alterations in cells that give rise to cancer can affect the transcriptional 
control of gene expression. Mutations that activate transcription factors, 
such as increased phosphorylation, can increase the binding of a 
transcription factor to its binding site in a promoter. This could lead to 
increased transcriptional activation of that gene that results in modified cell 
growth. Alternatively, a mutation in the DNA of a promoter or enhancer 
region can increase the binding ability of a transcription factor. This could 
also lead to the increased transcription and aberrant gene expression that is 
seen in cancer cells. 


Researchers have been investigating how to control the transcriptional 
activation of gene expression in cancer. Identifying how a transcription 
factor binds, or a pathway that activates where a gene can be turned off, has 
led to new drugs and new ways to treat cancer. In breast cancer, for 
example, many proteins are overexpressed. This can lead to increased 
phosphorylation of key transcription factors that increase transcription. One 
such example is the overexpression of the epidermal growth factor receptor 
(EGFR) in a subset of breast cancers. The EGFR pathway activates many 
protein kinases that, in turn, activate many transcription factors that control 
genes involved in cell growth. New drugs that prevent the activation of 
EGER have been developed and are used to treat these cancers. 


Cancer and Post-transcriptional Control 


Changes in the post-transcriptional control of a gene can also result in 
cancer. Recently, several groups of researchers have shown that specific 
cancers have altered expression of miRNAs. Because miRNAs bind to the 
3' UTR of RNA molecules to degrade them, overexpression of these 
miRNAs could be detrimental to normal cellular activity. Too many 
miRNAs could dramatically decrease the RNA population leading to a 
decrease in protein expression. Several studies have demonstrated a change 
in the miRNA population in specific cancer types. It appears that the subset 


of miRNAs expressed in breast cancer cells is quite different from the 
subset expressed in lung cancer cells or even from normal breast cells. This 
suggests that alterations in miRNA activity can contribute to the growth of 
breast cancer cells. These types of studies also suggest that if some 
miRNAs are specifically expressed only in cancer cells, they could be 
potential drug targets. It would, therefore, be conceivable that new drugs 
that turn off miRNA expression in cancer could be an effective method to 
treat cancer. 


Cancer and Translational/Post-translational Control 


There are many examples of how translational or post-translational 
modifications of proteins arise in cancer. Modifications are found in cancer 
cells from the increased translation of a protein to changes in protein 
phosphorylation to alternative splice variants of a protein. An example of 
how the expression of an alternative form of a protein can have 
dramatically different outcomes is seen in colon cancer cells. The c-Flip 
protein, a protein involved in mediating the cell death pathway, comes in 
two forms: long (c-FLIPL) and short (c-FLIPS). Both forms appear to be 
involved in initiating controlled cell death mechanisms in normal cells. 
However, in colon cancer cells, expression of the long form results in 
increased cell growth instead of cell death. Clearly, the expression of the 
wrong protein dramatically alters cell function and contributes to the 
development of cancer. 


New Drugs to Combat Cancer: Targeted Therapies 


Scientists are using what is known about the regulation of gene expression 
in disease states, including cancer, to develop new ways to treat and prevent 
disease development. Many scientists are designing drugs on the basis of 
the gene expression patterns within individual tumors. This idea, that 
therapy and medicines can be tailored to an individual, has given rise to the 
field of personalized medicine. With an increased understanding of gene 
regulation and gene function, medicines can be designed to specifically 
target diseased cells without harming healthy cells. Some new medicines, 
called targeted therapies, have exploited the overexpression of a specific 


protein or the mutation of a gene to develop a new medication to treat 
disease. One such example is the use of anti-EGF receptor medications to 
treat the subset of breast cancer tumors that have very high levels of the 
EGF protein. Undoubtedly, more targeted therapies will be developed as 
scientists learn more about how gene expression changes can cause cancer. 


Note: 

Career Connection 

Clinical Trial Coordinator 

A clinical trial coordinator is the person managing the proceedings of the 
clinical trial. This job includes coordinating patient schedules and 
appointments, maintaining detailed notes, building the database to track 
patients (especially for long-term follow-up studies), ensuring proper 
documentation has been acquired and accepted, and working with the 
nurses and doctors to facilitate the trial and publication of the results. A 
clinical trial coordinator may have a science background, like a nursing 
degree, or other certification. People who have worked in science labs or in 
clinical offices are also qualified to become a clinical trial coordinator. 
These jobs are generally in hospitals; however, some clinics and doctor’s 
offices also conduct clinical trials and may hire a coordinator. 


Section Summary 


Cancer can be described as a disease of altered gene expression. Changes at 
every level of eukaryotic gene expression can be detected in some form of 
cancer at some point in time. In order to understand how changes to gene 
expression can cause cancer, it is critical to understand how each stage of 
gene regulation works in normal cells. By understanding the mechanisms of 
control in normal, non-diseased cells, it will be easier for scientists to 
understand what goes wrong in disease states including complex ones like 
cancer. 


Glossary 


DNA methylation 
epigenetic modification that leads to gene silencing; commonly found 
in cancer cells 


histone acetylation 
epigenetic modification that leads to gene silencing; commonly found 
in cancer cells found in cancer cells 


myc 
oncogene that causes cancer in many cancer cells 


